Efficient Memory Organization for DNN Hardware Accelerator Implementation on PSoC
نویسندگان
چکیده
The use of deep learning solutions in different disciplines is increasing and their algorithms are computationally expensive most cases. For this reason, numerous hardware accelerators have appeared to compute operations efficiently parallel, achieving higher performance lower latency. These need large amounts data feed each computing layers, which makes it necessary handle the transfers that collect information from accelerators. implementation these accelerators, hybrid devices widely used, an embedded computer, where operating system can be run, a field-programmable gate array (FPGA), accelerator deployed. In work, we present software API organizes memory, preventing reallocating one memory area another, improves native Linux driver with 85% speed-up reduces frame time by 28% real application.
منابع مشابه
A Novel and Efficient Hardware Implementation of Scalar Point Multiplier
A new and highly efficient architecture for elliptic curve scalar point multiplication is presented. To achieve the maximum architectural and timing improvements we have reorganized and reordered the critical path of the Lopez-Dahab scalar point multiplication architecture such that logic structures are implemented in parallel and operations in the critical path are diverted to noncritical path...
متن کاملA Cache-Based Hardware Accelerator for Memory Data Movements
T his dissertation presents a hardware accelerator that is able to accelerate large (including non-parallel) memory data movements, in particular memory copies, performed traditionally by the processors. As today’s processors are tied with or have integrated caches with varying sizes (from several kilobytes in hand-held devices to many megabytes in desktop devices or large servers), it is only ...
متن کاملAn Efficient Hardware Accelerator for HS1-SIV Encryption Algorithm
Data security is a major concern for everyone in today’s informational world. Encryption is the process of encoding messages or information in such a way that only authorized parties can read it. It is one of the major information security solutions. Hash Stream1-Synthetic Initialization Vector (HS1-SIV) is a recently developed and fast encryption algorithm. In this paper1, we present a hardwar...
متن کاملHardware supported efficient accelerator partitioning for workstation consolidation and virtualization
Accelerators have gained an important role in recent years. While being used primarily in the scientific community in the beginning, they are now employed in a wide range of every day applications. Accelerators can hence be viewed in the focus of machine consolidation and virtualization, offering new opportunities for cost saving and services. Although these opportunities have been discussed in...
متن کاملHardware Accelerator Approach Towards Efficient Biometric Cryptosystems for Network Security
Protecting data and its communication is a critical part of the modern network. The science of protecting data, known as cryptography, uses secret keys to encrypt data in a format that is not easily decipherable. However,most commonly secure logons for a workstation connected to a network use passwords to perform user authentication. These passwords are a weak link in the security chain, and ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2021
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics10010094